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Introduction 

In 2008, approximately 1 0.8 million children ages 5—1 7 in 
the United States spoke a language other than English in 
the home (Aud et al., 2010). While most language minor- 
ity students receive all of their instruction in English, 
3.8 million students received English language learner 
services during the 2003-2004 school year (Capps, Fix, 
Murray, Ost, Passel, & Herwantoro, 2005). Compared 
with their native English- speaking peers, language 
minority students on average have lower reading perfor- 
mance in English (August & Shanahan, 2006). While 
numerous factors account for this gap, researchers have 
pointed to differences in word knowledge as part of the 
explanation. Language minority students have both less 
depth (Verhallen & Schoonen, 1993) and less breadth 
of vocabulary. In order to read with clear comprehen- 
sion, students also need to understand the words they 
read, construct an interpretive cognitive model of what 
the author is trying to say, and have the requisite back- 
ground knowledge to categorize, interpret, and remem- 
ber what an author is saying in relation to established 
facts or a field of understanding (such as a content 
area subject). Although a deficit in any of these areas 
may prevent an adolescent reader from comprehending 
grade-level texts, deficits in vocabulary knowledge (and 
in the world knowledge indexed by vocabulary knowl- 
edge) may be the most widely shared problem among 
struggling adolescent readers. 

One way that vocabulary supports reading compre- 
hension is through reading subskills. There is substantial 
evidence that phonological, orthographic, and semantic 
processing of words are interrelated, such that vocabu- 
lary knowledge predicts rates of word reading (Nation & 
Snowling, 2004), and students with better semantic abili- 


ties also have advantages in orthographic identification 
tasks (Yang & Perfetti, 2006). Vocabulary knowledge 
also plays an important role in students’ higher order 
comprehension abilities. Established as well as current 
models of reading comprehension argue that word 
meaning and form selection are critical to creating a situ- 
ation model from text and for integrating new knowl- 
edge from the text with prior background knowledge 
(Kintsch, 1986; Perfetti, Landi, & Oakhill, 2005). Recent 
reviews of research on adolescent literacy demonstrate 
that these higher order processes are exactly where 
most struggling adolescent readers break down (Kamil, 
2003; Rand Reading Study Group, 2002); thus it is not 
surprising that vocabulary scores show increasingly 
strong correlations with reading comprehension scores 
as students move from primary to middle and secondary 
grades (Snow, Porche, Tabors, & Harris, 2007). 

A challenging subdomain of vocabulary knowledge 
acquisition and instruction is all-purpose academic 
vocabulary, a segment of the lexicon that becomes 
particularly relevant to comprehension in adolescence. 
All-purpose academic vocabulary is a category with 
somewhat fuzzy boundaries, but prototypical members 
are words used for making fine distinctions in referring 
to communicative intents (e.g., affirm, confirm ), argumen- 
tation (e.g., evidence, conclusion, warrant ), abstract entities 
(e.g., theory, factor, process), and categories (e.g., vehicle, uten- 
sil, artifact). All-purpose academic words are used across 
content areas, occur frequently in glossaries where 
content-area words are defined, and receive little explicit 
instructional attention precisely because they are not 
seen as the responsibility of any content-area teacher. 
Yet control over this segment of the lexicon is crucial 
to comprehending and producing academic language. 
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This is the vocabulary domain on which the research 
program described in this brief focuses. 

By the time normally developing children enter middle 
school, most will have mastered thousands of words for 
oral use, but comprehension of the rich language of 
text requires an understanding of more and different 
words (Nation, 2006). In middle school, students begin 
to take core subject area classes and are expected to 
read and understand expository texts with increasingly 
difficult vocabulary demands (Gardner, 2004). Clearly, 
exposure to new words in texts is one of the primary 
vehicles for word learning (Nagy & Anderson, 1984; 
Nagy & Herman, 1987). However, there are differences 
in students’ abilities to learn new words incidentally 
while reading; these differences relate to their concur- 
rent vocabulary levels (McKeown, 1985) and to their 
comprehension levels (Swanborn & de Glopper, 2002). 
Without instruction and support, independent read- 
ing is unlikely to improve word-learning outcomes for 
students of low socioeconomic status, although highly 
skilled readers may benefit (Lawrence, 2009). 

Given the evidence that reading comprehension 
supports vocabulary development and that vocabulary 
development supports reading comprehension, we can 
describe the relationship between these two processes 
as one of reciprocal causation. It has been widely noted 
that less able students are likely to fall farther and farther 
behind if they struggle with learning processes linked 
by reciprocal causation (Stanovich, 1986). Fortunately, 
there is evidence that vocabulary instruction can have an 
important and lasting impact on student word learning 
(Beck, Perfetti, & McKeown, 1982; Carlo et al., 2004). 
There is reason to think, then, that a robust vocabulary 
intervention that targets academic language may improve 
vocabulary and reading comprehension in the short run 
while also supporting the struggling reader’s facility at 
learning new words independently. The research proj- 
ect described here presents findings from an unmatched 
quasi-experiment of the Word Generation Program, 
an intervention firmly grounded in what is currently 
known about effective practice, while also casting light 
on how enhanced vocabulary levels relate to improved 
reading comprehension. While findings from a quasi- 
experiment are not firm grounds for causal inference, 
the data here are suggestive and form the basis for our 
ongoing randomized trial. The actual program itself can 
be downloaded for free atwww.serpinstitute.org/word- 
generation. More information about how the program 
was created and how words were selected is also avail- 


able on the Web site and in published studies (Lawrence, 
White, & Snow, 2010; Snow, Lawrence, & White, 2009). 

Program Implementation 

The Word Generation materials define a list of key 
elements that are used to organize instruction. Those 
elements include the following: 

• Monday launch: Reading a paragraph aloud with 
students that introduces a civic dilemma, model- 
ing comprehension processes and word inferencing 
during reading, guiding discussion through compre- 
hension questions, highlighting focus words, elic- 
iting student opinions on the controversy of the 
week. This is usually done in English class. 

• Math activity: Recurrently using target vocabulary 
in all-purpose and (if applicable) math-specific ways, 
engaging students in discussion of math problems, 
reminding students of controversy and soliciting 
their thinking about it, revoicing student comments 
to model clarity and target word use. 

• Science activity: Recurrently using target vocabu- 
lary in all-purpose and (if applicable) science-specific 
ways, linking topic of the week to science content, 
reminding students of controversy and soliciting 
their thinking about it, revoicing student comments 
to model clarity and target word use. 

• Social studies activity: Recurrently using target 
vocabulary in all-purpose and (if applicable) social 
studies specific ways, structuring a debate format, 
giving all students a chance to participate in debate, 
revoicing student comments to model clarity and 
target word use. 

• Friday writing: Reviewing the controversy and 
reading the prompt aloud, reminding students to 
reread the paragraph or their notes to make good 
arguments for their point of view, having the target 
words posted or written on the board, ensuring quiet 
and order so students can write uninterruptedly. 

Quasi-Experiment of Word 
Generation in Partnership With Boston 
Public Schools 

In 2007, our research team began a quasi-experimental 
study in which academic word learning by students in five 
schools implementing the Word Generation Program 
was compared to academic word-learning by students 
in three schools within the same system that did not 
choose to implement the program. Because the imple- 
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menting schools were those that volunteered for the 
program, selection effects must be taken into account 
in interpreting the findings. The results presented here 
have been described in greater depth elsewhere (Snow, 
Lawrence, & White, 2009). 

Participants and Setting 
Schools 

At the start of the study, the average scores of the inter- 
vention schools on the state accountability assessment 
were lower than those of the comparison schools (mean 
of 56% failing in the treatment schools compared 
with 45% failing in the comparison schools). This is 
not surprising; the participating schools volunteered 
to do so, and those with lower scores were more likely 
to show an interest. We do not have detailed informa- 
tion about the vocabulary instruction in the comparison 
schools. Through limited observation and interviews we 
know that there is discipline-specific vocabulary being 
taught in each of the comparison schools as required 
by that state’s curriculum framework. Furthermore, in 
one school, a long-time literacy coach had coordinated 
vocabulary instruction to some extent through a school- 
wide word-of-the-week effort. That being said, none 
of the comparison schools was using a commercial 
vocabulary program, nor were any heavily invested in a 
school-wide approach to vocabulary instruction. 

Students 

All students in the treatment schools received the 
intervention; both pre- and posttest data were avail- 
able on 697 sixth-, seventh-, and eighth-grade students 
(349 girls and 348 boys) in the five treatment schools and 
319 students (162 girls and 157 boys) across the three 
comparison schools. Of these, 438 students were clas- 
sified as language minority (i.e., students whose parents 
reported preferring to receive materials in a language 
other than English); 287 in treatment schools and 151 
in comparison schools. The vast majority of students 
in both treatment and comparison schools were low- 
income. 

Research Design 

Data Collection and Analysis 

The efficacy of the intervention was assessed using a 
48-item multiple choice test that sampled words from 
throughout the year. A high proportion of students 
failed to complete the vocabulary assessments in the time 
available. Because items at the end of the assessment had 
particularly low rates of completion, we dropped the last 
four items from our analysis of both pre- and posttest. 


The reliability of the test with the 40 items that remained 
was acceptable (Cronbach’s alpha = .876). 

This instrument was administered to students in all 
of the treatment schools in October 2007, before the 
introduction of Word Generation materials. Because of 
difficulty recruiting the comparison schools, the pretest 
was not administered at these schools until January. The 
posttest (identical to the pretest except for the order of 
items) was administered in all of the schools in late May. 
Because of the unfortunate disparity in interval between 
pre- and posttesting in the two groups of schools, we 
analyzed words learned per month as well as total words 
learned. 

In addition to this curriculum-based assessment, 
we had access to most students’ spring 2008 scores on 
the Massachusetts Comprehensive Assessment System 
(MCAS) English Language Arts. Additionally, we had 
Group Reading and Diagnostic Evaluation (GRADE; 
Williams, 2000) for both spring and fall for a selection 
of students in comparison (n — 133) and treatment 
(n — 256) schools. These scores were provided by the 
district for all the students for whom data were avail- 
able. The decision to administer the GRADE was made 
at the school and classroom level. Thus, while these data 
are far from complete, we have no reason to think that 
there was a particular sampling bias across the schools. 

Findings 

Descriptive statistics show that students in the Word 
Generation Program learned approximately the number 
of words that differentiated eighth from sixth graders on 
the pretest. In other words, participation in 20-22 weeks 
of the curriculum was equivalent to two years of inci- 
dental learning. Unfortunately, the relative improvements 
in the Word Generation schools will be exaggerated 
by the differences in timing of the pretest. In order to 
account for the differences in test administration times, 
the pre-to-post improvement in all schools was divided 
by the number of months between the pre- and posttest 
administrations: the average improvement per months 
in the treatment schools was greater than that in the 
comparison schools. The average effect size of program 
participation on the researcher-developed vocabulary 
assessment was 0.49 (controlling for the improvement 
attained in the comparison schools). 

Regression analysis was used to determine if partic- 
ipation in Word Generation predicted improved 
vocabulary outcomes, controlling for the pretest. Boys 
learned more words than girls (J3 = -0.052, p < .007) 
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Figure 1 . Vocabulary improvement for English-only and language minority students in Word Generation and 
comparison schools. 


and participants in the program learned more words 
than nonparticipants (5 — 0.166, p < .001). Language 
status (language minority versus English only) was not 
a significant independent predictor of word learning, 
but language minority students learned words at a rela- 
tively faster rate than English-only students in treatment 
schools, but not comparison schools (language status 
interacted with treatment at the margin of significance, 
p — .OSS); including the interaction improved the overall 
model. Interestingly, student pretest vocabulary did not 
interact with treatment in predicting posttest scores. It 
was decided to split the data set to investigate the home 
language variable more closely. The first set of regres- 
sions used pretests and gender to predict posttest scores 
in the comparison schools (r = .62) and Word Genera- 
tion schools (f — .64). In Word Generation schools 
language minority status predicted improved vocabulary 
(fl = -0.053, = .022), but it was not a significant predic- 
tor in comparison schools. These results are demon- 
strated in Figure 1: In the comparison schools (the 
light lines) English-only students improved more than 
language minority students, in the treatment schools 
(bold lines) language-minority students improved more 
than English-only students. 

In order to determine whether participation in Word 
Generation had any relationship to performance on the 


MCAS, a regression model was fit with MCAS scores 
in April 2008 as the outcome; gender, treatment status, 
pretest, and posttest scores were used as predictors. 
We added an interaction term to see if posttest scores 
on the curriculum-based assessment interacted with 
treatment in predicting MCAS scores (controlling for 
pretest scores). The interaction term was significant 
(fi — .21, p — .01) and its inclusion improved the model. 
In other words, students who benefited most from 
participation in Word Generation had higher MCAS 
scores than students with similarly improved vocabular- 
ies acquired without Word Generation exposure. 

We further explored the interaction between treat- 
ment and vocabulary improvement by splitting the data 
and refitting the models to data from the treatment and 
comparison schools separately. The fitted model for 
comparison school data did not predict MCAS achieve- 
ment (R 2 = .41) as well as the fitted model for the treatment 
school data ( R 2 = .49). In the Word Generation schools 
student vocabulary posttest scores (fi — 0.527,/) < .001) 
were much stronger predictors of MCAS achievement 
than pretest scores were (5 = 0.201, /> < .001), perhaps 
because the posttest scores captured not only target 
vocabulary knowledge at the end of the year, but also 
level of student participation in the Word Generation 
program. These conclusions maintained even when we 
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used available baseline GRADE data as a covariate in 
our models. 

Longitudinal Follow-Up on 
Quasi-Experiment 

The goal of Word Generation is to improve vocabulary 
so that it results in improved reading comprehension; 
clearly, short-term vocabulary learning will not generate 
long-term comprehension improvement. Despite the 
evidence of vocabulary gains for all Word Generation 
participants on average, and in particular for language 
minority participants, we did not know whether these 
students maintained vocabulary knowledge after 
summer vacation and through the following school year. 
We conducted a follow-up longitudinal study to exam- 
ine the effects of Word Generation on the learning, 
maintenance, and consolidation of academic vocabu- 
lary for students from English-speaking homes, profi- 
cient English speakers from language minority homes, 
and limited English proficient students. U sing individual 
growth modeling, we found that students receiving Word 
Generation improved on average on target words during 
the instructional period. We confirmed that there was an 
interaction between instruction and language status such 
that English-proficient students from language minority 
homes improved more than English-proficient students 
from English-speaking homes. We administered follow- 
up assessments in the fall after the instructional period 
ended and the spring of the following year to determine 
how well students maintained and consolidated target 
academic words. Students who participated in the inter- 
vention maintained their relative improvements at both 
follow-up assessments (Lawrence, Capotosto, Branum- 
Martin, White, & Snow, 2010). We thus have reason to 
expect that these students will display improved read- 
ing comprehension and enhanced academic learning. A 
randomized experimental study of Word Generation 
now underway will enable us to test this expectation 
more rigorously. 
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